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The seven serologically different botulinum nenro- 
toxins are highly potent protein toxins that inhibit 
n orotransmitter release from peripheral cholinergic 
synapses. The activated toxins consist of the toxifying 
A-subunits (Afr = 50,000) linked by a disulfide bond to 
the receptor-binding BC-snbunits (Af, ^ 100,000). We 
have established the complete sequence of botulinum 
n urotoxin type A (BoNT/A; 1,296 amino acid resi- 
dues* Mr s 149»425) and a partial sequence of botuli- 
num neurotoxin type E (273 amino acid residues) as 
deduced from the corresponding nucleotide sequences 
of the chromosomally located structural genes. The 
promoter of the BoNT/A gene is inactive in Escherichia 
coU. Primer extension experiments indicated that ini- 
tiation of transcription of the BoNT/A gene occurred 
118 nucleotides upstream from the ATG codon. A com- 
parison of the protein sequence revealed an overall 
identity of 33*8% to that of tetanus toxin. No signifi- 
cant similarity to other known proteins including 
ADP-ribosylating toxins could be detected. Three of 
the six histidine residues of the A-subunit of BoNT/A 
were found in the peptide sequence H^'^ELIHXXH^ 
within a domain of predicted a-helical secondary struc- 
ture. This motif is also found in similar positions of the 
A-subunits of tetanus toxin and BoNT/E. 


The Gram-positive spore-forming bacterium Clostridium 
botuUruim proiduces several highly toxic types of botulinum 
neuTOtoxuis (BoNTs)' designated A, B, Cl, Dl, E, F, and G. 
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Today, the number of fatal cases of botulism is fortunately 
small due to improved hygienic precautions during production 
of food. In the past, however, predominantly BoNT/A and 
BoNT/E, the latter mostly found in decayed seafood, were 
identified as the agents causing life-threatening food poison- 
ing of man (for a review, see.Tacket and Rogawski, 1989). 
BoNTs resemble, in regard to biosynthesis and molecular 
architecture, tetanus toxin, whose sequence has alzeadly been 
determined (Eisel et al., 1986; Fairweather and Lyness. 1986). 
The toxins are produced as single-chain polypeptides (Af, s 
150,000) which are, with the exception of BoNT/E, proteo- 
lytically processed into two subunits upon release from the 
organism. The two subunits, designated A- and BC-fragments 
(corresponding to the light and heavy chains), remain cova- 
lently associated by a disulfide bond. Toxification by tetanus 
toxin and BoNTs is initiated by binding of the toxin molecules 
to specific ganglioside receptors, followed by receptor-me- 
diated endocytosis (Black and Dolly. 1986a, 1986b). In con- 
trast to tetanus toxin, which acts preferentially after retro- 
grade axonal transport by blocking the release of inhibitory 
neurotransmitters from central synapses, BoNTs evoke their 
toxic effects in the periphery by poisoning cholinergic syn- 
apses (Simpson, 1989). Recent studies based on the adminis- 
tration of isolated A-fragments from tetanus toxin or from 
BoNT/A to permeabilized bovine chromaffin ceUs have shown 
that the A-fragments alone are sullicient to inhibit exocytosis 
( Ahnert-Hilger et oL, 1989; Bittner et oL, 1989). The molecular 
mechanisms, however, that underlie toxification of central or 
peripheral synapses through clostridial neurotoxins still re- 
main completely unclear. The characterization of BoNT genes 
and the combined in vitro transcription/translations of 
subfragmente thereof could provide usefiil tools for identifying 
cellular components involved in late stages of transmitter 
release. Furthermore, the application of specific DNA probes 
could be of advantage in the quality control of food produc- 
tion. Finally, the genetic development of nontoxic derivatives 
should allow the production of safer vaccines. 

In this study, we present for the first time a complete 
sequence of a botulinum neurotoxin, BoNT/A, as well as part 
of the sequence of BoNT/E. We compare these sequences 
with those of tetanus toxin and partial amino acid sequences 
of other botulinum toxins. We show that a predicted a-helical 
domain within the A-subunit containing 3 histidine residues 
is conserved in tetantis toxin and BoNT/A and BoNT/E. In 
addition, we have mapped th transcription start site of the 
BoNT/A gene. Our data suggest that BoNT/A is translated 
from monocistronic mRNA. 

MATERIALS AND METHODS 

Biosafety Hcguiations — Personnel involved in the isolation and 
cloning of DNA from C. botulinum were immunized with a pentameric 
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vaccine consisting of formaldehyde-treated BoNT/A through BoNT/ 
£. Only DNA fragments smaller than 2.0 kb (1.0 kb for BoNT/E- 
specific DNA) were cloned under L3B1 containment following the 
protocols of Maniatis et al (1982). Toxicity assays were perfonned in 
mice using 200 ^1 of a bacterial lysate obtained from 3X10^ bacteria 
(Boroff and Fleck, 1966). None of the recombinant bacterial clones 
exhibited toxicity. Recombinant DNA was isolated from JM83 and 
sequenced directly using the chain termination method (Sanger et al,, 
1977). 

Bacterial DNA end Ptosmwis— For the isolation of chromosomal 
DNA from C botuiinum (subtype A, strain 62A), overnight precul- 
tures were prepared in 10 ml of brain-heart infusion broth and used 
to inoculate a 190-ml culture. The culture was incubated at 37 'C for 
about 8 h to ODMOnn ^ 0.3. C. botuiinum type E (strain Beluga) was 
grown for 24 h in 500 ml of broth containing 2% (w/v) proteose 
peptone, 0.5% (w/v) glucose, and 0.1% (w/v) cysteine. The cells were 
centrifiiged at 4 *C and 5,000 x g for 15 min and resuspended in 8 ml 
of TE buffer (10 mM Tris/HCl. 25 mM EDTA, pH 8.0) containing 2 
mg/inl lysozyme. The suspension was incubated for 1 h at 37 'C. At 
this time, the EDTA concentration was raised to 100 mM, iand 
Sarkosyl (1% final concentration) and proteinase K (0.1 mg/ml fmal 
concentration) were added. The mixture was extracted twice with 
phenol and once more with chloroform prior to dialysis against three 
changes of 1.000 ml of TE buffer. The mixture was treated for 2 h at 
37 *C with RNase A (0.1 mg/ml final concentration), extracted with 
phenohchloroform (1:1), and precipitated with ethanol. Plasmids 
pSP65 and pSP64 iKrieg and Melton, 1984) were used to establish 
BoNT/A-^^Hc DNA libraries and for in vitro transcription. Esch* 
erichia coU strain HBlOl was used for the amplification of plasmid 
DNA. BoNT/E-specific libraries were constructed in pUCl9 (Yan- 
isch-Perron et al,, 1985) using E. coli strain DH5ir. 

Enzymes and Nucleotides — Restriction endonudeases. T4 DNA 
Hgase, and T4 polynucleotide kinase were from Boehringer Mann- 
heim. Nucleotides and T7 polymerase were from P-L Biochemicals. 
Exonuclease III, Si nuclease, and Klenow polymerase were purchased 
from Promega Biotec. A nick translation system and radiolabeled {a- 
•"PJdCTP O3000 Ci/mmol). |7-*1*]ATP O3000 Ci/mmol), and ^'S- 
dCTP (>800 Ci/mmol) were obtained from Amersham Corp. 

Hybridization Coni/tttbns— Oligonucleotides were synthesized by 
the phosphoramidite method using an Applied Biosystems Model 
380A automated synthesizer. To iden tify B oNT/A-specific sequences, 
a single oligonucleotide (5'-AAACAATTTAATTATAAAGATCC-3') 
was used. This sequence corresponds to Lys'^ to Pro'~ of a partial 
amino acid sequence published previously (Schmidt et al., 1984). 
BoNT/E-specific DNA was screened with a mixture of 29-mer oli- 
gonucleotides (5'-TTTAA(C/T)TATAA(C/T)GATCCTGTTAA. 
TGA(C/T)CG-3') whose sequences corresponded to Phe' to Arg" 
from the partial amino acid sequence reported by Schmidt et al. 
(1985). GeneScreen Plus filters (Du Pont-New England Nuclear) 
were prehybridized in 6 X SSC, 10 X Denhardt*s solution, and 100 ng 
of sheared denatured herring sperm DN A/ml at 43 *C. Hybridizations 
were performed with 10' cpm of the oligonucleotide probe for 24 h at 
43 *C. The filters containing BoNT/A-specific DNA fragments were 
washed twice for 10 min at 43 *C in 6 X SSC, 0.1% SDS. Washing of 
BoNT/E DNA-containing filters was performed at 45 *C in 2 X BSC, 
1% SDS, 0.1% sodium pyrophosphate. Filters were exposed overnight 
to Fuji RX films. Southern analyses involving nick-translated probes 
were performed as described (Maniatis et al, 1982). 

Combined in Vitro Transcription/Translation of BoNT/A-specific 
DNA Fragments-^Piasmid pKN25, harboring the 1.5-kb Sspl frag- 
ment of the BoNT/A gene, was purified by two successive centrifu- 
gation steps on CsCl gradients or by a single passage over a QIAGEN 
column (Diagen, Dusseldorf, Federal Republic of Germany). The 
DNA was linearized downstream from the coding region by digestion 
. with X6al. Transcriptions and translations in a reticulocyte lysate 
(Amersham Corp.) were performed as described (Mayer et ai, 1988), 
using [^SJmethionine (Du Pont-New England Nuclear; 1 mCi/ml 
final concentration) as radiolabeled amino acid. Translation products 
were analyzed by SDS-PAGE on 12% gels. 

Mopping Initiation Site for Transcription— Tola] RNA was isolated 
from C. botuiinum as described (Ausubel et aL, 1987). For primer 
extension experiments, a heptadecamer oligonucleotide whose se- 
quence was complementary to nucleotides -10 to +7 (Fig. 2) was 5'- 
labeled with (7-*T]ATP and applied in the primer extension reaction 
as described (Bensi et al„ 1986) and in the sequencing reaction using 
pKNl3 DNA as a template. 

Anii6odie«— Polyclonal and monoclonal antibodies against form- 
aldehyde-treated BoNT/A were kindly provided by Dr. S. Kozaki and 


Dr. G. Sakaguchi (Osaka Prefecture University, Osaka, Japan). The 
production of polyclonal antibodies has been described previously 
(Kozaki and Sakaguchi, 1975). Toxicity assays were performed in 
mice according to Boroff and Fleck (1966). 

Modification of BoNT/ A with DEPC— Carboxyethylation reactions 
of purified BoNT/A (2.6 nmol) were performed with a 1000-fold 
molar excess of DEPC (ethoxyformic anhydride; Sigma) in 1 ml of 
100 mM sodium phosphate, pH 6.0, essentially as described by 
DasGupta and Rasraussen (1984). The absorption at 240 run was 
monitored using the same amount of toxin in buffer as a blank. DEPC 
was added from a stock solution in ethanol. The blank was treated 
with ethanol only. For calculation of the concentration of N-carbox- 
yhistidine, an extinction coeHlcient of 3200 M'^cm'* was used. 
Reversal of the DEPC modification was attempted with DEPC toxoid 
obtained after a 1-h treatment. The procedure for incubation with 
NH:iOH • HCl was as described previously (DasGupta and Rasmussen, 
1984). In our hands, however, this treatment failed to restore toxicity. 

RESULTS AND DISCUSSION 

Previous studies on the structural gene of tetanus toxin 
revealed the strong bias of Clostridia toward (A + T)-rich 
codons (Eisel et o/., 1986). For this reason, we used oligonu- 
cleotide probes reflecting this specific codon usage. The 
BoNT/A probe specificaUy recognized chromosomal DNA 
fragments of 3.7, 1.40, 1.50, and 0.45 kb as obtained by 
digestion with endonucleases £coRI, £ScoRI/A;uII, 5spl, and 
HindlU, respectively (Fig. lA). DNA migrating in the region 
of the 1.50*kb Sspl band was electroeluted from the agarose 
gel and cloned under L3 biosafety conditions into the Smal 
site of pSP65. pKN2 and pKN25 contained the desired frag- 
ment in opposite orientations. Direct sequencing of pKN2 
and pKN25 and exonuclease Ill-truncated variants of both 
clones revealed that the inserted 1498-ba8e pair Sspl fragment 
encoded the entire A-subunit of BoNT/A. Antisense RNA 
was transcribed from pkN2 and used to screen additional 
DNA libraries. The entire sequence of the BoNT/A gene was 
established from five overlapping clones (see Fig. IB) and 
deleted variants thereof. To further verity the identity of the 
cloned Sspl fragment, we performed combined in vitro tran- 
scription/translations using purified pKN25 DNA. For this 
purpose, no modifications of pKN25 were required since se- 
quencing of the inserted Sspl fragment had revealed that the 
authentic ATG codon for initiation of translation of BoNT/ 
A was indeed the first ATG codon 59 nucleotides downstr am 
from the SP6 promoter. The products obtained in the reticu- 
locyte lysate were analyzed by SDS-PAGE (Pig. IC). The 
majority of the radiolabel was incoiporated into a molecular 
species of about 56 kDa (lane i ), as would be expected from 
the coding capacity of the Sspl fragment. The polypeptide 
was specifically precipitated an A-subunit-specific mono- 
clonal antibody (lanes 3 and 4) and by a polyclonal rabbit 
serum raised against complete BoNT/A toxoid {lane 5), 

The 5'-radiolabeled BoNT/E-specific oligonucleotide hy- 
bridized to a single 9S4-base pair £coRI fragment from chro- 
mosomal type E-specific DNA (data not shown). The local- 
ization of this fragment within the BoNT/E gene is shown in 
Fig. IB. The fragment was cloned into pUCl9 and sequenced 
(Fig. 28). 

Fig. 2A shows a continuous stretch of 4,836 nucleotides 
derived from chromosomal DNA of C. botuiinum type A 
containing a single open reading frame of 3,888 nucleotides. 
The corresponding protein (1,296 amino acid residues) has a 
molecular mass of 149,425 Da. The A -f T c ntent in the 5'- 
noncoding regions of the BoNT/A and BoNT/E genes is 80.4 
and 80.3%, respectively, and thus higher than in the corre- 
sponding coding regions where a total of 73.6 and 72.1% A -f 
T is found. The codon usage of the BoNT/A gene was closely 
related to that of the tetanus toxin gene. A total of 63.4% 
(63.1 % in tetanus toxin) A -H T was found at positi n 1, 70.7% 
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FiC. 1. Identification and organization of atnictural gene 
encoding BoNT/A and BoNT/E. Southern analyses of chio- 
mosoma) DNA from C. botuUnum type A after digestion with £ScolU 
{lane B), Sspl {kne C). EeoRL/Pmll {lane Dh HindlU {lane Eh or 
Xbal {lane F). Lanes A and G show XDNA cleaved with HindlU and 
£ScoRJ///md]I]» respectively. Lanes A'^C' show the results after 
hybridization with the 6'-labeIed oligoniicleotide. B, the coding region 
for BoNT/A containing the A-, B-, and C-subfragmenta is indicated 
by the open boxes. HorisonUdhars and designations indicate individual 
clones used for sequencing. B, Bgdl; E, EcoBl; Hc» Hinelh Hi, 
HindlU: S, Ssph P, PouU. The localization of the 984-ba8e pair 
EcoBl fragment from BoNT/E is included at the bottom. C, in vitw 
transcription/translation of pKN25. The l^kbSspl insert of pKN25 
encodes a polypeptide of 56,3i64 Da that contains the entire A-subunst. . 
5'-Capped mRNA was obtained with SP6 RNA polymerase and 
translated in the presence of ('''^Jmethionine in a reticulocyte lysate. 
Producu were analyzed by SDS-PAGE on a 12% gel (lone /) or 
subjected to immunoprecipitation using preimmune serum {lane 2), 
monoclonal antibody E-2 at 10 tig of purified IgG/ml {lane 3) or at a 
100 iig/ml final concentration ibne 4h or a polyclonal rabbit serum 
{laneSh 


1 1 ) AT0CCATnmMTAAACMmAATTATMA6ATCeTCTAAM«nOT 

HFFVRItqriiTKPPVICVDtA 
<6I )TATATAAAAAmCAAATeCM6*CAAATeCAAaACrAAAAGCrfT1^^ 

TIKI PSACQNQPVKArS'SMP 
1 131 )*A*AttT00CnATTCCACAAAQtfATACAmACAAATCCTC»Ar<yig6*r>mAAAT 

KitfviPctDTrTsrtEcaLii 

I lt|l CC A CO lCD>auyBC A AAACAA6TOattTrtCAWWaCT^^ 

PPPCAKQVPVSYTOSTtLtr 
1 2A1 fCATAAtCAAAAACATAAnAmAAABCGACmCAAAAmTItOlfiMCM^ 

DKCKDVTLB CVT.K LrtI S Y S 
i tOX lACTCATCTTCCAACAATCTTUnAACATCAATACIAACOCCMTACCAll 1 1UM.1 

T D LCtMLLTtlV ■- C I P P U'6 C 
i m l*CTAOU^TACATACAft|ATTAAA»CCTATOAttCTAATWimAiaClCATllC^^ 

STI OTCiKV tSTSCtSV IQ f 
. A3I >CAT0CTACnATACATCAOWtfAACTAATClAeTAATAATA0CACCaCACCiaiTAn 

DCSTISCCiaLVllCffADI 


IQPCCK'Sr««tVL«lTt|iCV- 

I wicccTaACTCMTi>CAnAO^ffiAccGa>caTmAC Aii n m caACfCAcn 

CSTQTlirSPOPTPC.PttSL 

1 601 >CMCTTCATACAAATCCrCTmAG6TCaOCCMATTTCC1ACAb^m^^ 

CVOTKPLLGACK PATDPAVt 
< Ml )n*CC*O^TPUCTTATAC*TCCT6CACAT<SmATAT0CAATABCA>TTAAlCC^ 

tANEll ■ACHRLT.G IAtVPlI 
4 ;?1 >A0CCTTmAAACTAAATACrMTCCCTAmTQUMTC«CT00CTI*OMCTAAC^^ 

■ vrKV STKATVEnSGLtVS P 
• 711 )CACCAAmACAACATTTCC6CCACATCAlOCAAACrrTAXACAtJCm/u:^^ 

CeLBTrCCMOAKP IOSL QEIt 
I «* 1 ICAATTTOCTCTATmAmTAATAACmAMCATATAOCAACTACACTrAATAAACa 

trKLYYVMK PKO l ASYL PKA 
f 901 «AAATCAATACTACCTACTACTCCTTCATTACMC1AYATCAAAAATemTTAAACAGAAA 

rS IVCTTAS LQYilKKVPKKK 
f »TATCTCCTATCTCA*CATACATCTCCAAA A I 1 1 ICCC I i >CAT*AATtAAAATTT6ATAAC 

VLLS t DTSC KP SVDKLK PDK 
iO:i ?mTACAAA»TtfrrAACAC«ATn*CA C ACACCATAATTTTCTTA»CmTTYAftACTA 

L Y K .1 L T e 1 Y T t D JE P V K P P K V 

LSftCTYLiSrDKAVPKIVlVP 
: ! &1 • AMXTAMTYAOCAATAYAYmtCCAYYYAATTYAACAAATACIMYTIAC^^ . 

KVSTT I YDC rSLSltTSLAAS 
1201 «YYTAATevTrAAAATACAC!AAATTAAYAATATBAAtrrTACrAA*CUMAAATnTACT 

rsCCSYClfSSSfTKLKXPT 
' m\ tCaATTernCMYmATAMTTCCWTCIGTAilMSCeAYAATAACYTCTAAA^ 

CrLPCF YKtLCVSC I lYSKTK 
I ».*1 •TCATrACATAAACCATACAATAAtVCAYTAAATCAYTTATCYATCAAACTTAATAATtCC 

>LDKCY;CKAL:S9LC1KrK5k* 

• tCAmcrrmTi^TCmCACAACATAAYTTrACYAATGAYCIAAAYAAACGACAACAA 

OLrrirscoxfYsoiSKCCc 

. 1 UnACATfTCATAaAATA7M?AAC C ACC*CAAC;A A A» T ATYACT?TACATTTAATACAA 

i7SDr:c 1 CAACCxi sldli o 

• I M>1 ^r^TATTATTTAACCTTYMTYTTCAYAATGAACCTGAAAATATYTCAAYACAAAATCTY 

ovvLiriErDiccrcstsiisL 

I M 1 } YCANrrCACATYAYACCCCAArrMAACTUTeOCTAATAIMAMGAineCIMim 

fsoiicQLtLBpaiisrpac 

I UgnAAIuy^CTATatCYTACAYAAATAY ACTAT6TTCaYl Are nC C Y BC l CM (B<MinW*A 

R IETBlOKYTaPe YLSAQtrt 
t till »CATCCYAAAYCTACaiTlCCmAACAAATTCTCTlAA C CAAC C A Y lAYTAAAYCClACT 

1ICK»R 1 A LT V tYS KALLVP I 
f 1 ?Al jeCtCTYTATAC Al 1 1 I I I Itl IC ACACYATCT*A*CAAACTTAAtAAAOCtACCCACCCA 

IVTTPPSt»TVKKYKKATCA 
1 1 101 )CCTATCTTTTTJ>CM. I CU i lAOUCAATTACTATATCAYTYTACCCAtCAAA CY AB CC A A 

AMPLCV VCQ LVYDPTDtYf C 
1 1 161 )CTAACTACTAC06ATAAAATTCCOCATATAACTATAATTAYTCCAtATAYA0GACCICCr 

VfTTDKlAOIYl lIPYl C P A 
f 1921 )nAAATATAOCTAATATCTTAYATMACAYCATTTtCrAeCTCCmAATAnTKAOCA 

LPICRHLYKODPYCALt P » C 
{ l»ll )CCTCTrATfCTmACAATnATACCACACAYTCCAATACCTCTATtAOBlACTrrreCA 

AY 1 L LI P I P C 1 A I P T L C T P A 

LYSY IAHtVLYVQTI D > A L t 
12101 )AAAACAAATGAAAAATGCCATCACCTCTAYAAATATAYACTAACAAATT0CTTA6CAAN6 

KftSCKVD EV YKYI VTSVL AK 
(2U1 )CTTAATACACA»TreAYCTAATA AC AAAAAAAA Y CAAAOytflCr n A r MM TC A*fl C * 

VSTQIOLISStNa;B*L.I>Q* 
t2iai|CAACCAACAAACCCTATAATAAAaATCACttTA<aCMlA«>CI C t CC A »flAg A i AM T 

EAYKAtlNTQTaQTrtllKII 
l2l>l)AATATTAATnTAATATTCAlCAYTTAACTlCCAAACrTAATCACtCU Y AAA T AA^(B a 

aispaiDOLJSKLattteKA 

UJAlMTCATTAATATAAAlAAAnTTTauYeAATPaCfCTYTCAtATrTAAXCAATTeTAR 

HIPiaKPLIIQCST S Y L II « t N 
(2AQI )ATCCCTUTBCTeTYAAA0CeTTAOUieATTnGATCCTACrCTTAAACATCCAYtAm 

IPYCVBSLtOPOASLKOALl 
iPAftDAACTATAYAYATCATAATACAOCAACYTTAATTBCTCAACTACAtACAYTAAAACAYAAA 

KYIYDPBCTLIGQV DftLKDK 
(»2l)CTTAAYAATACACTTACYACAeATATACCTTTTCACCITTCCAAATACCtA6AYAATCM 

VPKTLSTOl PPQLS KYVDIQ 
{2561 lACATTAYTATCTACATrTACTCAAtATATTAACAATATTATTAAXACTTCtATATlCAAr 

RLL8YPTC Y I KSX I BTt 1 L B 
(2Ul)TTAACATATCAAACtAATCATTTAAYACACTUTCrACCtATCCATCAAAAAYAAATATT 

LBYCtNHLt DL S B Y A S K X B 1 
(2701 )OCTACTAAAetAAATTnCATCCAATACAYAAAAATCAAATTCAATTATTTAATTTACAA 

CSKVIIFDPX DKBQ1QLPBLK 
I2T61 )ACrACTAAAATTCAOCTAATTTTAAAAAATCCTATTCTAYATAAYACtATCTATCAAAAT 

SSKlEVILKaAtVTBtnYEB 
(2UnTTTMTACTAlCCrTriQMTAAUATIGCTAACIArTTTAAC«BUTAACTCrMAlMT 

PSTSrVIBlPRTPaSlSLBB 

Fig. 2. Nucleotide and amino acid sequences of BoNT/A 
gene (A) and of BoNT/E-8pecific\£^IU fragment (B). The 
arrowhead marks the initiation site for transcription. The '^lO region 
of the toxin promoter is underlined, Shine-Dalgamo sequences are 
boxed Divergent arrows in the 3'-noncoding region of the BoNT/A 
gene indicate motifs with dyad symmetry. Numbers define the melting 
energy in kilocalories as calculated according to Tinoco et aL (1973). 
Encoded amino acids are in single-letter code. 
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(288l)CA*TATACIUaMTAMTTClATCGUMTmTCMCATOGUUU«TATCAC^^ 

CYTI 1 HCMeVllSCtfKVSLNy 
(394l)0CXCAAATAATCTOG»CmACM»ATACTCACQMiTAAAACMACACTA^ 

GCI IWTLQPTQCXKQaVVFK 
( MOl )TACACTCAAATUTTiUTATATCMAmTATAMCACATGGAn^^ 

TSQMl M ISDTIVRVIFVTIT 
( JO«t lAATAATACATTAAATAACTCTAAAATnATATAAATCCAACATTAATACATCAAMACCA 

NNR-. RRSKIYIRC RLIDqXP 
(3121)ATTrCAAAmAfiCTAATAtTCATKTACrAATAATATAATCmAAATTACAT6CTTCT 

XSN'. CRIHASRRIHPKIDGC 
( )181 )AaM»TA«CAlACATATATnCCATAAAATATmAATC1TnT«TAACCAATrAMT 

ROTHRTIVIRYFRLPDKCLN 
0241 )CAAAAACAAATCAAACAmATATMTAATCAATCAAATTCACCTATmAAAACACnT 

eXElKDLTDRQSNSCILKDr 
( )30l >TGGGCTCAmmACAATAT6ATMACCATACTATAmTAAAmATATCArCCAAAT 

VCDYLQYOKrYYHLNLYOPN 
( 1361)AAAYATGTCGAT6TAAAYAAT6rAC6TATTACA0CTTATATCYAYCtTAAA6CCCCTACA 

KVVDVRHVCt RCYHYLRCPR 
( 3ft21 )6GTACOCnMTGMTACAAACATTrAmAAATYCAACTTTCrATA0OC6GACAAAATn 

CRVRTTNIYIIISSLYRCYKF 
(3«8l)AYTATAAAAAAATATCCT1t:TCCAAATAAACATAATAnCTTACAAATAATCAT0CTCTA 

XIRK YASCNRDKIVRNKDBV 
( IMl )TATATTAATC:ACrACTTAAAAAYAAACAATATACCTTACCTACTAATCCATCACACCCA 

YIRVVVRRKCYRLATNASQA 
( 3601 )GCCCTACAAAaAATACTAACTCCAYTACAAATaCCTCaTCTACCaaATCTAACTCAACTA 

CVCCI LSALIIPOVCftLSQV 
( 3661 }CTACTAATCAACTCAAAAAATCAYCAAGCAATAACAAATAAATCCAAAATCAATTrACAA 

VVNKSRKDQCITRKCKMKLQ 
(3721 )CATAATAATCCCAATCATATACCCmATACGATTTCATCACTTTAATAATATACCTAM 

DRNCIlDXCFICrHQreCKlAK 
(3761 )CTACTACCAACTAATTCCTATAATACACAAATACAAACATCTACTACCA L I 1 IMAI I t C 

LVASRVYRRqiCRSSRTLCC 
OIADTCATCOGAATTTATTCCTCTACATCATOCATOOCCACAAACCCCACT t<«ttaatecr« 

StfCri PVODCVCERPL* 
()901)«actacac(S|;tctftcaaRMtcctecgcaMc«ccc«ta«Matctc«uarcMC«t 
(3961>|cce««|iaata«ccasac«cft«tcat:ftC(^«et^ccrcgcc«aBta^c«ggta«« 

< e.« > 

(402l>aa«at«Maaitaa8#c«r(acu<e:cactcesat«cretaceMaftCca|(«fcciir« 


(A08l)accttccctii«ccctct(ttiacc»taMaeictattt«eceAtcaact|trMfittrr« 
(Al4l)«ctagtCAaajet«t^ascccccCaaKata«cacactretf«tttt«CMtttrrrMa 
(A}01)acccC£ecat<cgaccat(«CcaaCacatct«ccaartcgagacatactttRaKttgi|ii 
(4261)ct»CCCeact««gttc«eecttgaaagatctacctgtatattga«aacrgcl«rrarig; 
(*32 1 )gaaaaAgCggac cagcateaggac tggaggCMctgccc t«tc«M||;t 1 1 rajMcarM 
1 4 381 )SB*cgt csct a c ccRAt c tcccMtca»c«u»utA« gr car f ai ratgCMarr«a 

(44 2 OltaCtVCACtCASf tMgeC K ( tgCtCettC Bf t MAC 

B 

( • 228 )ga«t(caagt«gta|;atajii«a«a«t«jii grc-Acaf,*: : ; f ta« : : 
(•t60}«ataacgatat acetate teca«etgtttaacctca«rt(atMaf««T HI AMI nVtiit: 
f 0 120)1 tgtecaCaaaa«atc«agattacaattgggttatacgl|»lcttAaica(gatfcftM»« 
< * 60 ) aaaaagt c a c a c c ca c ggaca t taaaaaa c a tataaa I c c a««a t igUS]^ gr I « ; 
( 1 ) ATCCCAAAAATTAATACrmAAnATAATCATCCTCTrAATCATAr^CAArrrrATAT 
HPKIRSFRYRDFVSORT I LV 
( 6 DATTAAA C CACC aX TTCTCAACAATrTTATAAATCATTTAATATTATCAAAAATATTTiy 
IKPCCCQEPYKSFKINKKIi: 
( 1 3 1 ) ATAATTCCAGAGACAAATCTAATTGCTACAACCCCCCAACATTTTCATCCCCCTACTTCA 

IIPCRRVICTYPQOFHPPTS 
(l81>YTAAAAAATGCACAYACritCTTAnAnMCCTAATTATrTACAAAirrMTCAAi;^^ 
LRRCOSSYVDPRVLQSDECK 
(24])GATAieATrnTAAAAATACTCACAAAAAYAYTTAATACAATAAAYAAYAAYCmCA06A 

DRFLRIVTRI FRRlllllRLSC 
(301 )G0QiTmATTMAACAACrCTC4AAACCYAAY0MTAmACGCMiaiTAATACTCCA 
CI LLIELSKAHPYLCIfDRYP 
(36l)GAYAA1CAATTGCATATrOCTCATCCATCAOCAfinCACAnAAATTCTCAAATCeTA6C 
DRQFHICDASAVC.I RFSRCS 
(421) CAAfUXATACTATTACCTAATCTTAn ATAATCCCACCACACCCTCAYnATTTCAAACT 

QOlLLPilVI IKCACPOLFET 
(481 )AACACT7CCAATATrTCTCTAACAAATAAYYATATCCCAA(XAATCACCCrrnCGAYCA 
HSSttlSLBRNYKPSRHRrcS 

(Ml) ayacctatactaacattctcacctcaaTayycttttacatttaatcataatactatcaat 

1 AIVYPS PEYSFR PRON SHN 
( 60 1 ) CAATYTA YXCAAGATCCTCCYCYYACATTAATCCATCAATTAAYACATYCATYArATCGA 

EFXQOPALYLflHELlRSLHr. 
(661 )CTAYATCCCCCTAAAC66ArrACTAGAAACYATACTATAACACAAAAACAAAATCCCCTA 

LYCAKCIYYKYYIYQKQRPI. 
( 731 ) AYAACAAAYATAA6ACGTACAAAYAYT6AAC4ATYC 

lYRIRCTRIKEP 

FiC. 2— continued 


(69.4%) at position 2, and 86.7% (83.7%) at position 3. 

We have reported previously that the promoter controlling 
the tetanus toxin gene is located on plasmid pE88 immediately 
upstream from the coding region (Eisel et at., 1986). Tran- 
scription was shpwn to start 127 nucleotides upstream from 
the ATG codon for translation (Niemann et cd., 1988). To 
determine whether the chromosomally located BoNT/A gene 
was part of an operon or whether it was transcribed into a 
monocistronic mRNA» we mapped the transcription start site 
by primer extension experiments using clostridial mRNA as 
a template. The results presented in Fig. ZA demonstrate that 
transcription begins 118 nucleotides upstream from the ATG 
start codon. Fig. 3B shows an alignment of the putative 
promoter regions of the BoNT/A and tetanus toxin promoters 
in comparison to a consensus sequence compiled from 29 
different promoters from Gram^positive bacteria (Graves and 
Rabinowitz, 1986) and the lac promoter. In general, a precise 
allocation of the -35 and -10 regions is difficult within DNA 
sequences containing about 80% A -f T. Therefore, the as- 
signment of these regions in Fig. 3B is tentative only. The 


bot a -as -10 -s 

\ 

ATTTCCAATTOTTTAA CCCTAT CTTATXACGCTAAATATA TATGTT TATC7ATCAAA 
•••a 

t«t -35 -10 -5 ♦! 

tu 

TAAAAATTTAAArrr TCACTT TACAAAAXATAACCTCAT TATGTT ATATCTAATTC 
••a M 

Cr«»- -35 -10 -5 +1 

T4-AAAAA TTCACA «— A R-T-TC- TATAAT AAtAt— 

Imc -35 -10 -5 *l 

CCCACCCCACWC TTTACA CTTTATGCTTCCGCCTCG TATGTT CTCTC6AA 

Fig. 3. Characterization of promoter of BoNT/A ff«ne. A. 
mapping of transcription start site. mRNA was isolated from C. 
botulinum type A. The oligonucleotide 5'-ATGGCATATTTAA- 
CACC-3' complementary to nucleotides *10 to ^1 of the BoNT/A 
gene was 5'*radiolabeled and used in a primer extension experiment 
using reverse transcriptase and for direct sequencing with T7 DNA 
polymerase. The products of both reactions were analyzed on the 
same gel. For the detection of the reverse transcribed product {Ime 
P), however, the gel was exposed 3-fold longer (3 d^ys). B, comparison 
of promoter regions. The putative *35 and -10 regions are listed for 
the BoNT/A ibot a) and tetanus toxin {tet) genes and compared with 
a consensus promoter sequence established from 29 different pro- 
moter sequences from Gram-positive bacteria (Graves and Rabino- 
witz, 1986) and the lac promoter. Vertical arrows in^cate initiation 
sites for transcription. The promoter of the tetanus toxin gene has 
been characterized previously (Niemann et of., 1988). For further 
details, see the text. 

numeric -35 regions of the toxin promoters show almost no 
similarity to the Gram-positive consensus promoter. More 
closely related -35 motifs are indicated by the asterisks. These 
regions, however, would be separated from the corresponding 
Pribnow boxes by 22 and 21 nucleotides, a distance nly 
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BeVr/ft 


90 
90 

Bon/B Bi 

Beay/ft uo 

Yir 120 
BsaV/B uc 

Boiy/ft 149 
TIT 190 
Bavr/S 342 

Boar/ft 114 
yn ISO 
Boay/B lyt 

Beay/I 200 

yiy 210 
Boay/i 199 

Beay/ft 2W 

yn 240 

Ba«y/B 219 

BG»y/ft 240 
TB 249 
BolT/B 149 

Boar/ft 290 

m 299 
Baay/ft 120 

ny 929 

Bevy/ft aso 
m 199 

Bo«r/ft XV9 

yn IB9 

Bear/ft 409 
yiy 419 


e9TBbrBBtyByDb«BBbbT9XVBctprBe 
TSTibraiXBBBfftOBftbbBBixaftipybc 
XTTBiraBiaaabBCCibLKtbSBftapybc 

cs y I 0 y B b B y 1 0 y a c X a V I 0 B D 0 B y 

asyBbbBBPoyasaBPfpabbBODPseftyy 
BOB y p D a « P B X 0 a A S ft V B X E p 9 a c t 0 

BBBBb abVXIOPIftBIXQPBCBSrCBfVb 

BBftBbTabixpopcpybaaaEfBcxybBfD 
p X b b p a y X X B 0 ft B p p b p B y a 9 9 a x 9 b B a a 

BbT B a 6 y C 9 y Q y X B P 9 P B P T P C P B B 9 

aBarppCBDCPcsiaQBftPCPBVTpypaaT 
TBPsa BB p«sxftxyTP9PBysp9PaBa 

icvoTaPbbCftCBPftyBPAVTbftBBbXBftc 
xtaiy9byicBsiTro8PftLbi.aBBbiBTL 

IBBBPIODPftLYLBBIbiatL 

BBbyCXfttaPBIVpIfBTBftTYBBICLBVS 
BGbYORQV S t 9 B I X P 9 I 0 I X y a 0 a Y y P X I 
BCbY6AB4XVYBYyiYQK0VPLIYVtBCYa 

PBS.«iyp8saBA8PxasbOBaBPBbyyyaB 
ftiBi*PTPe«oaftabX9XDXBBabyBBTbao 

IB BP 

rioiftBTbaBftiBiyoTTftBboyaiavPBB 

y.BftXftaBbS«TT9CBDPaiBIB9TBOXy00 


BTOPPBaBacoyxvaBOBPOibvaBxaycF 


VBXBbOBSPa.XBTBbBypBaaaBPVBXpab 


bBBTXTBaTSOPBXBBBDbBBCyBGOaaftV 


aaBapTBbBapycbPBpyBbbfi 
ayaftPBayac 9 c b v 9 b b x o.bQ 


VBeXXTBB 
^BBIIPPVa 


Beay/Vl499 YBB bOBOTaS 


ft b a a b©x lyaavoLfr 


Beay/9Pi 


XBBBbTaBTftSbTObCCt L©X KXiatDLVr 

• ft p c XII D V a a K D 


B 9 X 
B 9 X 


b r p 

XBXBBCBbPP 

iBxaaoBbPP 


Boai/B 
Beay/B 


9oaV/A 

m 


BOBY/A 


Boay/A 

yn . 


BYB 


BPBBaaPTaDbaaoBSXYBaTaxBAftBBax 
xftBBaaPBBBProBcxysyaTBaBPbapay 

9 a B a X V 

VAlBBXyaD 

SbDbX00yyi>YPaPDBSPBaiSIBab89BX 

tbOEiivay a b 0 9 B I T b p a 0 ft y T p V y B « I 
xcobEbBpaiBBppvcBBYBbDByyarayb 

PYAPXYBSBAA 9 T X X Z B B I D D > Y I V 0 Y L 

9SS ■AeBrBBCBBBXAbTBSTaXAbbaPSBTYY 
Ml yAQBiPTYb«iITBYB9V0DAbta9YIXYt 


499 
909 


939 
917 


Bo«Y/A 
YIT 


Bdry/A 
yn 


S49 

919 


lis 

424 


PPSADVVBByaiAYBAABrbOVVSQLyyor 

yppi vxsBvaocAOGXbPbQvvBpixoDr 


YDXYBXYByYDXXABIYIIlPYZGPAba] 
YaXABOXTTIOXXBDVBYXVPyiGPAbB] 


•eVY/A 449 aBbYBPDPVCAbXPSGAVXbbBPIPBlAIP 
yn IS4 BQGYXGaPIOAbKTYGTTbLbKYlPEIYbP 


Ba»Y/A 

yn 


VbCYPAbVByXAaXVbYTOYIDBAbSIiaC 
VXAAbBXABtlYQXBBIlIYIOBPbCEIYC 


BoBY/A 709 XaPXVYBTXyYaVbAXyiVOIDlllEIBII 


yn 


7X4 VVIXVYBbVBABVbGYYBYQrOKiSYOBYl 


9<i«Y/A 
TtT 


711 
744 


AlXaQAXAYXAIXaYQYBOYYCIXIBaXBr 
S:.XYOyDAXSXXXPyKYKXTiCP0IB0IAP 


Boay/A 74S BIOPbfBXbaXBXaXABiaXBIPbBOCATS 

TIT 714 XXBBbXaKbXXXABXABiaXBIPBIIIBSI 

Boay/A 79} YbBaSBXpyGVKIbXOrOASLlDAbbiyi 

TIT 104 PbVaOBXaXAKKObbXrOYOtlBILBOVXB 

•oVY/A 

TTT 


•24 VpaBCYbXGQVOBbKPKyaaVllTBIPrOl 

t>4 AasxrXGXYXbxxbxiKxaiYrsYPiprsT 


FkG. 4. Alignment of sequences of clostridial neurotoxins. 
Conser^ cysteine residues are circled. Sequences weie established 
in this paper (a) or were from Eisel et al, (1986) ib), DasGupU and 
Datta (1988) (c), Fujii et oL (1988) (</), and Gimenez et al (1988) (e). 
TET, tetanus toxin; BUTYR, BoNT/E (from Chstridium butyricum, 
Gimenez etaL 1988). * 
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B p B b p e®B pTBBYxaiByrabrPBBbasBB 
y b B b p bQb aaaovTsxoxPBxrcBftbar bi 

XBPbYoaoBasoxbBPpaopybOTPi'v yb 
XBSbyTsybBXTrbBBpaoppbaypTBT tb 

bBbypPBBYVPVBBTOXBBTBTbBCPB C BY 

ipYftxxBs p y 0 b B a X T a T a y b T a ft p a T T 

BTY a X y b a 9 S b y B 6 T I P t X B B V ft B 0 a K P 
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TBBXbBftbXXP PVOabBQTTVaBBBBOOO 

paabBBibBvcyaftPcxpbyBBBiftVBbBP 
ITBICBBB b 0 p a 8 c a p s « r I C P 8 0 f ■ 
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XPVppGVOCRPb 
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Fig. 4— continued 

seldoxnly observed in Grain-positive promoters (Graves and 
Rabinowitz, 1986). The -10 regions of the BoNT/A and 
tetanus toxin promoters were identical to that of the lac 
promoter. However, in contrast to the tetanus toxin promoter 
that had been shown to be active in £L coU (Eisel et oL, 1986) 
and exhibit about one-third of the strength of the tac promoter 
(Niemann et aL, 1988), the promoter of the BoNT/A gene 
was apparently inactive in E. coli since neither toxin-specific 
peptides, nor transcripts could be demonstrated in pKNlS- 
transfected H BlOl c ells (data not shown). Interestingly, the 
-36 motif (6'-TTTACA-3'), characteristic of the lac pro- 
rooter, was also found in the DNA sequence of the tetanus 
toxin gene {underlined in Fig. 3B). Site-directed mutagenesis 
of this latter motif will provide information on whether the 
observed similarities are indeed important for the activity of 
the tetanus toxin promoter in E. colL Taking into account 
that the 3'-noncoding region of the BoNT/A gene contains 
two regions with dyad symmetry (uredeW^ied by divergent . 
arrows in Fig. 2A) which are characterized by tree energy 
values (Tinoco et oL, 1973), dG(25 'C), of up to -17 kcal and 
which therefore probably constitute transcriptional barriers, 
we postulate that BoNT/A, like tetanus toidn, is translated 
from monocistronic mRNA. 

A comparison of the complete amino acid sequence of 
BoNT/A with that of tetanus toxin and tiie partial sequence 
deduced from the BoNT/E-specific EcoRI fragment is shown 
in Fig. 4. The amino acid composition of BoNT/A closely 
resembles constituent analyses performed on purified A- and 
BC-subunits (Sathyamoorthy and DasGupta, 1985). Further- 
more, the sequence is in total agreement with: (i) a partial 
nucleotide sequence btained f r the NHrterminai region of 
the A-subunit (Betley et oL, 1989); (ii) the sequence of the 
NH2 terminus of the BC-subunit (Sathyamoorthy et oL, 1986) 
as determined by Edman degradation; and (iii) an internal 
peptide sequence from the BC-subunit containing Ile"^ to 
Leu'«V 

' B. R. DasGupta, persona] communication. 
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The comparison of the two sequences newly established in 
this study with th sequence of tetanus toxin reveals several 
interesting features, (i) There is an overall 33.8% identity 
between BoNT/A and tetanus toxin at the amino acid level 
(30.8% within the A-subunit and 35.4% within the BC-sub- 
unit). Although these values increase to 40.5% for the A- 
subunit and 48.1% for the BC-subunit when conservative 
exchanges such as Arg to Lys, Glu to Asp» etc, are neglected, 
the identity is much smaller than originally anticipated (Eisel 
et al,, 1986). The identity between the BoNT/E-specific 
subfragment (containing 273 amino acid residues) and the 
corresponding domain of tetanus toxin amounts to 30.4% 
(41.4% excluding conserved exchanges) as compared to 38.8% 
(45.8%) identity between the same domains of BoNT/E and 
BoNT/A. The identity between tetanus toxin and BoNT/A 
in this region is 33.3% (42.1%). 

(ii) From Thr^ to Phe™" of the B-fragment, the identity 
reaches 47.9% (61.8% including conservative exchanges). This 
domain of the tetanus toxin molecule has been shown to form 
channels in asolectin vesicles at a pH below 5.0 (Roa and 
Boquet, 1985) and one may speculate that this domain pro- 
vides the machinery for translocation of the individual A- 
subunits into the cytosol 

(iii) The alignment of the sequences shown in Fig. 4 yields 
4 conserved cysteine residues (involving Cys**', Cys^, Cys'**", 
and Cys''^* of BoNT/A), suggesting that these residues may 
be involved in disulfide bridges. Indeed, the former 2 cysteine 
residues participate in the disulfide bond between the A- and 
BC-subunits. Recent data by Krieglstein et al. (1990) indicate 
that the internal disulfide bridge present in the C-fragment 

f tetanus toxin involves Cys*^* and Cys"*". No such small 
loop structure could be formed in BoNT/A. At present, how- 
ever, no data are available that would indicate whether the 
internal disulfide bridge present within the C-fragment of 
tetanus toxin is important for binding to specific ganglioside 
receptors. In addition, the corresponding domain of BoNT/A 
has not been analyzed for the presence of internal disulfide 
bonds. 

(iv) The A-subunit of BoNT/A contains 6 histidine resi- 
dues, 3 of which (His^, His^', and His'^) are conserved in 
tetanus toxin and BoNT/E. These histidine residues appear 
in a short peptide sequence which has a predicted or-helical 
structure according to the rules of Chou and Fasman (1978). 
Due to the specific localization of the 3 histidine residues 
within the helix, the imidazole side groups of the histidine 
residues should appear at the same face of the helix winding, 
allowing these side groups to interact with each other or with 
an as yet undefined cellular ligand. DEPC, a reagent specific 
for histidine, lysine, and serine residues, was shown to detox- 
ify BoNT/E and, to a lesser extent, also BoNT/A (DasGupta 
and Rasmussen, 1984). In our experiments, treatment of 
BoNT/A with a 1000-fold molar excess of DEPC led to 
complete detoxification after the incorporation of 30 mol of 
reagent/mol of protein. Although the immunological proper- 
ties of BoNT/A were not altered by such treatment, as as- 
sessed by Ouchterlony gel diffusion assays, toxicity could not 
be restored by subsequent treatment with hydroxylamine 
(data not shown). This indicates that other residues in addi- 
tion to the histidines are irreversibly modified by DEPC. Site- 
directed mutagenesis of the conserved histidines within the 
a-helical sequences in the three neurotoxins should reveal 


whether this motif contributes to the biol gical function of 
the neurotoxins. 
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